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ABSTRACT 

The Kepler survey has provided a wealth of astrophysical knowledge by continuously 
monitoring over 150,000 stars. The resulting database contains thousands of examples 
of known variability types and at least as many that cannot be classified yet. In or¬ 
der to reveal the knowledge hidden in the database, we introduce a new visualisation 
method that allows us to inspect regularly sampled time series in an explorative fash¬ 
ion. To that end, we propose dimensionality reduction on the parameters of a model 
capable of representing time series as fixed-length vector representation. We show 
that a more refined objective function can be chosen by minimising the reconstruc¬ 
tion error, that is the deviation between prediction and observation, of the observed 
time series instead of reconstructing model parameters. The proposed visualisation 
exhibits a strong correlation between the variability behaviour of the light curves and 
their physical properties. As a consequence, temperature and surface gravity can, for 
some stars, be directly inferred from non- (or quasi-) periodic light curves. 

Key words: techniques: photometric - astronomical data bases: miscellaneous - 
methods: data analysis - methods: statistical. 


1 INTRODUCTION 


The launch of the Corot and Kepler spacecrafts (Auvergne 


et al. 2009 and Borucki et al. 2010 \ initiated a new era in 


the study of stellar variability. The unobstructed view to the 
light of astronomical sources enabled the continuous mon¬ 
itoring of stellar sources with short cadence and very low 
photometric error. While the primary goal of both missions 
was the detection of solid exoplanets, the continuous moni¬ 
toring of 150, 000 variable main sequence stars allowed a very 
detailed study of their variability behaviour. While astroseis- 
mology ( Gilliland et al.|2010 | and exoplanet detection (Lis- 
sauer et al. 20121 greatly benefit from those observations, 


many objects remain unlabelled. The labelling of (quasi-) 
periodic sources is quite reliable (e.g., Benko et al. 2010[ 


Slawson et al.|2011|, however, a large fraction of the objects 


that show no periodic behaviour remains unclassified. 

In order to investigate the nature of the objects that 
cannot be explained by known variability mechanisms, al¬ 
ternative approaches have to be found. Visualisation (i.e. di¬ 
mensionality reduction) and clustering are the most promi¬ 
nent representative algorithms from the camp of unsuper¬ 
vised learning. Visualisation in particular allows an intuitive 
inspection of the properties of the observed data by project¬ 
ing them in a lower dimensional space | Kramer et al.|20T3 1. 
Data analysts can interpret the visualisation plot and look 
for structures and similarities that could not be detected in 
the original data space. However, it is not possible to directly 
apply dimensionality reduction to raw sequential data as the 
individual measurements are not independent and therefore 


the time series cannot be treated like vectorial data. To cir¬ 
cumvent this problem, [Matijevic et al.| ( |2012[ ) apply local lin¬ 
ear embedding to light curves of binary stars that have been 
previously phase-folded and pre-aligned. Alignment is only 
possible if the light curve is periodic and a uniquely identifi¬ 
able point exists (e.g. point of deepest eclipse for binaries), 
with respect to which each light curve of a given class can be 
aligned. However, besides phase-folding and alignment, the 
main issue still persists namely that the sequential nature of 
the data has been ignored. This implies that the dynamical 
behaviour of the physical systems will not be (adequately) 
captured in the visualisation plots. 


In this work, we use the echo state network (ESN, [Luko-| 
sevicius fc Jaeger|2009 1 to describe time series as sequences. 


The ESN is a discrete time recurrent neural network that 
is used to forecast the behaviour of time series. The ESN, 
as other neural networks, is parametrised by a vector of 
weights. Training the ESN on a time-series yields an op¬ 
timised vector of weights which in this work we use as a 
fixed-length vector representation for regularly sampled time 
series. The advantage of this new representation is that it is 
invariant to the variable length of time series as well as to 
the presence of time shifts. Instead of performing the visual¬ 
isation on the original data items, we perform visualisation 
on this new representation. This amounts to coupling the 
visualisation algorithm to the ESN model and thereby ob¬ 
taining a more meaningful visualisation that does take into 
account the time behaviour of the time series data. With 
this tool at hand, we visualise a dataset of light curves from 
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the Kepler survey and highlight the meaning of this visn- 
alisation in the context of the physical properties of these 
stars. 

In Section]^ the proposed methodology is explained in 
more detail, while Section |3]describes the Kepler survey. The 
results of the visualisation are presented in Section m fol¬ 
lowed by a discussion with respect to physical properties in 
Section [S] Finally, the prospects of the presented method¬ 
ology in the analysis of time series and other astronomical 
data are discussed in Section]^ 


2 METHODOLOGY 


In the following, we provide a brief description of the ESN- 
based visualisation method. A more detailed account of the 
adopted methodology in this work has previously appeared 


Gianniotis et al. (20151. The terms time-series and se¬ 


quence are used interchangeably. 


2.1 Encoding sequences as readouts 


It is evident that dimensionality reduction designed for vec¬ 
torial data cannot be directly applied on time series as this 
would not correctly take into account their sequential na¬ 
ture. Therefore, prior to dimensionality reduction, an ap¬ 
propriate vectorial representation of time-series needs to be 
found that is able to deal with variable lengths and shifts 
along the time axis. To that purpose, we employ the ESN 
architecture ( Lukosevicius fc Jaeger|2009 1. An ESN is a dis¬ 
crete time recurrent neural network that learns to predict 
on regularly sampled time series: given an observation yt at 
time t, it makes a prediction for yt+i- ESNs have the great 
advantage that, in contrast to other neural network archi¬ 
tectures, the hidden non-linear part (known as reservoir) is 
hxed and only the output weights need to be trained. The 
output weights, the so called readouts, interact linearly with 
the reservoir and are the only free parameters in the model. 
Hence, the ESN may be viewed as a linear model written 
as yt+i ~ (f){yt)w, where w £ are the readouts and 

<p{yt) £ R'^^^ are the activations of the hidden non-linear 
part induced by input yt- (j) encodes various aspects of the 
neural network architecture of the ESN (number of neurons 
h, connectivity, weight structure, etc.) which we do not dis¬ 
cuss here; instead we refer the interested reader to |Gianniotis| 
et al. (20151 and references therein. 


We use the ESN to encode regularly sampled time series 
as a fixed-length vector representation. We use the notation 
y to denote an entire light curve y = (j/i,... ,yT) £ R^^^ 
composed of individual observations yt £ R. Also, we use 
$ € R^’^^ as the matrix that accumulates row-wise all ac¬ 
tivations 4>(yt)- Hence, given a sequence y, the optimised 
readouts are found by solving the least squares problem 
lly —The obtained readout w is taken as the new rep¬ 
resentation for a light curve y. Given a dataset of N number 
of light curves {y ^^^, ■ ■ ■, the first step in our approach 

is to encode each light curve as a readout vector. This results 
to a new dataset of N readout vectors ..., 



2.2 Dimensionality reduction 


Dimensionality reduction algorithms seek to find low¬ 
dimensional representations a; £ R'^ of high-dimensional 
data items w G R"*, where q < d, so that they can be vi¬ 
sually inspected, i.e. g = 2 or g = 3. In this case, w stands 
for the readout weights returned by the ESN. A typical cri¬ 
terion that drives the training of visualisation algorithms is 
the reconstruction error: one attempts to reconstruct from 
the low-dimensional representations x the original vector w. 
Of course, due to the information loss incurred during the 
dimensionality reduction, the reconstruction is only approx¬ 
imate and we obtain reconstructed readouts w. A typical 
choice for the reconstruction error is the squared L 2 norm, 
||iy — 


For instance, principal component analysis (PGA, 


Hotelling 19331 projects w linearly to a low-dimensional 


space spanned by the top two eigenvectors ui,U 2 of the sam¬ 
ple covariance matrix. In this case, the linear projection is 
given by matrix U = [ui U 2 ] £ the lower dimensional 

representation reads x = (wU)'^, and the reconstruction 
reads w = (Ux)'^. Hence, the reconstruction error of PGA 
is given by \\w — UU'^w\\^. 

Another dimensionality reduction algorithm, often 
viewed as a kind of non-linear PGA, is the autoencoder 
(Kramer 19911. The autoencoder may be viewed as the com¬ 
position of two functions, the projection to low-dimensions 
denoted by fenciw) = x (the non-linear analogue of U), and 
the reconstruction to the high-dimensional space denoted by 
fdec{x) = w (the non-linear analogue of U'^). Hence, the 
complete mapping is a function f{w, 9) = fdecifenci'^)) ~ 
w, where 0 are the free parameters, the weights of the au¬ 
toencoder. The autoencoder’s reconstruction error is given 
by |]iy — /(iy;0)|p which is the analogue to the PGA ob¬ 
jective Ijio — UU'^wW'^. 


2.3 ESN-coupled autoencoder 


So far, the presented algorithms measure how good the re¬ 
construction between the original weight w and the recon¬ 
structed weights w is in terms of their L 2 distance. A bet¬ 
ter measure would be to check how well the reconstruction 
w still represents its corresponding sequence y. This can 
be simply checked by looking at the reconstruction error 
\\y — ^w\\‘^ measured with respect to the sequence y, as 


proposed in Gianniotis et al. (20151. This measure is much 


more meaningful as it measures reconstruction in terms of 
the prediction quality of the underlying ESN model. This 
suggests modifying the reconstruction error of the autoen¬ 
coder so that it now reads: 


\\y - ^f{w, 0)f = \\y - = \\y - y\f 
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Visualisation Linear Objective 

PCA Yes ||in — iu|P = ||in — UU'^w\\'^ 

Plain autoencoder No ||io — wW^ = ||u; — f{w\ 0)|P 

ESN-AE No \\y - i/|p = \\y - ^f[w,e)\\^ 

Table 1. Summary of proposed visualisation algorithms. 


Essentially, the above reconstruction error entails an au¬ 
toencoder that is coupled to an ESN as illustrated in Fig. 
the echo state network is used to encode light curves y as 
readouts w. Next, the readouts are compressed to a low 
dimensional representation x. Out of the low dimensional 
representation we then reconstruct a readout w which when 
plugged back to the ESN should give a reconstruction y 
for the input sequence y. Optimisation of the autoencoder 
follows by gradient optimisation via the backpropagation al¬ 
gorithm ( |Bishopjl996P as typically done for neural networks. 
We term the proposed visualisation approach as the ESN- 
coupled autoencoder which we abbreviate here as ESN-AE. 


2.4 Summary of employed models 

We briefly summarise the concepts presented in the preced¬ 
ing sections. The basic idea of the presented approach is to 
find a suitable vector representation of the time series data. 
Therefore, an ESN is trained on each light curve y and the 
resulting readout w is used as a fixed-length vector repre¬ 
sentation. 

The standard way of reducing the dimensionality of vec¬ 
tors is to find representations that minimise the L 2 norm 
between the original vectors and their reconstructions. This 
can be done either in a linear (PCA) or non-linear (autoen¬ 
coder) fashion. However, minimising |]ro — mlp is problem¬ 
atic as it does not capture in any way how well the recon¬ 
structed readout w still represents the corresponding light 
curve y. On the other hand, in the proposed ESN-AE model 
we put forward an alternative reconstruction that measures 
how well a reconstructed readout weight ib can still predict 
the original light curves y it represents. In order to distin¬ 
guish the ESN-AE model from the autoencoder, we hence¬ 
forth call the later one “plain autoencoder”. We summarise 
these concepts in Tab. [T] 


3 DATA 


The data used in this work come from the Kepler satellite 


mission described in detail in Borucki et al. (20101. The 


Kepler mission aims for the detection of Earth-like planets 
and therefore continuously monitors 150, 000 main sequence 
stars in a fixed 115 square degrees field of view located be¬ 
tween Cygnus and Lyrae. Besides the detection of exoplan¬ 
ets, the mission opened the window to a detailed study of the 
dynamical behaviour of variable stars (e.g., RR Lyrae stars). 
Sources were observed with different cadences over several 
quarters. In this work, we focus only on data from the first 
quarter with long exposures of 29.4 min each (33.16 days in 
total). The available data volume, even in the chosen subset, 
is quite large as the first quarter comprises already 240 mil¬ 
lion photometric measurements. In order to save computing 
time and focus on objects that have not been classified be¬ 
fore, we limit ourselves to objects that are unlikely to be 


periodic by choosing only objects with Pfl > 0.5 from the 


catalogue in Debosscher et al. (20111. In order to include 


only objects that show considerable variability we select ob¬ 
jects with 

std{F) > 3{AF) 

where F is the pre-search data conditioning simple aperture 
photometry (PDCSAP, [Smith et al.|2012 l flux of the object 
and (AF) is the average photometric error. The light curves 
were then preprocessed by taking the logarithm (base 10) 
of the flux and subtracting the median of it. In order to get 
rid of individual spikes, caused by cosmic ray hits, we go 
through the values yt of each light curve and replace them, 
using a rather conservative constraint, according to: 


yt 


if - yt\ > MAD)^{\yt+l - yt\ > MAD) 
yt, otherwise 


where 


= median{{yt+j\j G [-10,10]}) 


and MAD is the median absolute deviation for the en¬ 
tire light curve y. On average, 150 of the 1624 observa¬ 
tions have been replaced per time series. After the replace¬ 
ment of spikes, three further light curves had to be ex¬ 
cluded: KIC4902072 appears to exceed the numerical range 
and therefore its amplitude is occasionally swapping signs; 
KIC4346303 shows dramatically higher amplitude and was 
removed in order to avoid introducing a bias in the visuali¬ 
sation; KIC6117602 still shows some heavy spikes after the 
spike removal which are probably of an instrumental origin 
(damaged pixel) as well. Hence, the final dataset was com¬ 
posed of 6,206 Kepler light curves. 


4 RESULTS 


In Fig. [^ we display the visualisations obtained by the ESN- 
AE, PCA and a plain autoencoder. We mention in pass¬ 
ing that the parameter configuration used for the proposed 
method was an ESN with a hidden reservoir of 50 neurons 


using the cycle architecture (Rodan & Tiho 20121 coupled 


to an autoencoder with a hidden layer of 10 neurons. More 
information on the parameters, can be found in |Giannio^ 
tis et al. (20151. For the plain autoencoder, we also used a 


hidden layer of 10 neurons. 

The visualisations in Figs. |2b| |2c| are driven by opti¬ 
mising the L 2 norm on the readout representations which 
means that what they display is a two-dimensional projec¬ 
tion of the readouts w. On the other hand, the ESN-AE 
in Fig. is driven by optimising reconstruction on the se¬ 
quences y, and hence what we obtain is a two-dimensional 
projection of the sequences. Though the readout parameters 
w do capture important information about their respective 
sequences, we do not expect in general that their visualisa¬ 
tion will impart a meaningful result. In other words, PCA 
and the plain autoencoder jndge two light curves to be sim¬ 
ilar, if their respective readouts are similar in the L 2 sense. 
The ESN, however, judges two sequences to be similar if 
their corresponding readouts result in similar sequences. 

We begin our analysis with the ESN-AE in Fig. Af¬ 
ter inspecting this visualisation we noticed that there are. 
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AE component 1 


(a) ESN-AE. 



0 5 

PCA component 1 

(b) PCA on ESN weights. 



AE component 1 


(c) Plain autoencoder on ESN weights. 


Figure 2. Visualisation of Kepler light curves using a coupled autoencoder (a), a principal components analysis (b) and a plain 
autoencoder (c). The visualisations are performed on the weights originating from the ESN. The division into the classes is based on the 
ESN-AE visualisation. For the sake of clarity, only 60% of the points are plotted. 
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Figure 3. Example light curves drawn from Fig.[^ The x-axis 
corresponds to 33.16 days for each of the images, ticks have been 
removed for clarity. The y-axis denotes the logarithm of the PDC- 
SAP flux with an arbitrary offset and scaling for better visibility. 



-4 


-2-101234 
AE component 1 


Figure 4. Autoencoder visualisation with overplotted known ro¬ 
tational stars from [McQuillan et al.| l |2014[ |. Additionally, giant 
stars as obtained by the temperature-log ^-criterion in |Hekker| 
jet al.| l |20TTl l are shown. For the sake of clarity, only 60% of the 
points are plotted. 


roughly speaking, four regions of particular patterns which 
we highlight with four different colours. In every coloured 
region, we have marked three randomly chosen light curves, 
which we display in Fig. It turns out that the four re¬ 
gions show distinct variability behaviour. The yellow branch 
shows periodic, dip-like variability, fairly common for rotat¬ 
ing stars, while the cyan curves show a prominent variability 
on time scales of roughly ten days. The red curves are noisy 
with small-amplitude and short-term variability, while the 
green curves show a low-amplitude variability as well but 
on longer time scales. We note that the red and green class 
are rather similar in their appearance. We also show how the 
behaviour of sequences smoothly changes variability regimes 
as we cross the borders of these four regions. For instance, 
we note how the yellow sequence (1-2-3) transits from short 
to long term behaviour as we traverse from the edge of the 
yellow region towards the border with the blue region. A sim¬ 
ilar behaviour is observed for the red sequences, as we again 
traverse this region from sequence (1) towards sequence (3) 
close to the blue and yellow border. This is of course not 
accidental, but rather by construction, as the ESN-AE is 
placing similar dynamical behaviour in similar locations. 

We use the same colours in the other two visualisa¬ 
tions, in order to show where each coloured region in Fig. 
is mapped in the PCA visualisation of Fig. |2b| and in the 
plain-autoencoder visualisation of Fig. We stress at this 
point that we are aware that our interpretation of the map is 
subjective, and that other ways of highlighting the visualisa¬ 
tion are possible. However, compared to the other visualisa¬ 
tions, we find it easier in Fig. to discern some structure. 
For instance, in Fig. |2b| we see that the readouts w look all 
rather similar to PCA as no distinct structure stands out. 
This could be perhaps attributed to the linear and inflexible 
nature of PCA, as the plain-autoencoder in Fig. shows 
projections organised in certain subgroups. 

With the selected example light curves at hand, we are 
able to reinvestigate the meaning of the visualisations. The 
transition from short to long term behaviour in the yellow 
sequence (1-2-3) to the cyan sequences is fairly clear and ap¬ 
parent in all visualisations, even though, the distances are 
judged quite differently between PCA, ESN-AE on one side 


and plain autoencoder on the other. Strong support for the 
ESN-AE and the plain autoencoder comes from the relative 
distance between red-3 and yellow-3 which is, compared to 
other distances, quite large in the PCA visualisation. How¬ 
ever, when inspecting the light curves we note a strong sim¬ 
ilarity between these two and attribute the high distance 
in the PCA to the inflexibility of the visualisation. Two 
light curves of the green sequence (2,3) show quite signih- 
cant long-term variability as opposed to the noise-dominated 
light curves in the red sequence (1,2). In the plain autoen¬ 
coder, the distances between those are judged significantly 
different to the PCA and ESN-AE visualisations. While this 
alone is not a strong argument, it is interesting to see that 
the noise-dominated light curve from the green sequence (1) 
is projected very far away from the noisy red ones, in terms 
of relative distances. 

Additionally, the visualisation of the plain autoencoder 
shows a significant overlap between the yellow, cyan and 
red light curves, even though, the variability behaviour of 
these three classes is inherently quite distinct. We also note 
that the plain autoencoder, does not enjoy the same smooth 
change in behaviour observed in the ESN-AE visualisation. 
This is very likely due to the missing link between recon¬ 
struction of weights and reconstruction of sequences; this 
hinders the plain autoencoder in correctly interpreting the 
weights w leading to a visualisation that cannot be easily 
comprehended. 

Given the above considerations and the principled ob¬ 
jective function, we argue that the ESN-AE model provides 
a better visualisation of the Kepler light curves. Of course, 
the evaluation of the visualisation is thereby not conclusive. 
In order to gain further insight as to how meaningful it is, 
we investigate in the next section how certain physical prop¬ 
erties relate to the ESN-AE visualisation. 

5 DISCUSSION 

The central point of this work is the visualisation of time 
series with respect to a given model, an ESN in this case. By 
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(b) Temperature 
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(c) Surface gravity 


Figure 5. Visualisation of the coupled autoencoder with overplotted properties from the KIC. A strong correlation between the visual¬ 
isation and the temperature as well as the surface gravity is apparent. For the sake of clarity, only 60% of the points are plotted. Due to 
the presence of a few extreme values, we also limit the colour scale to its present range to avoid colour saturation. 


design, the ESN-AE delivers lower reconstruction errors on 
time series of all models investigated in the previous section. 
The other algorithms were only used in order to highlight 
and discuss the impact of the coupling in the visualisation. 
Therefore, the discussion focuses solely on the visualisation 
results of this model. 

The periodic variability with dip-like occurrences (yel¬ 
low region) as well as the long quasi-periodic behaviour of 
the cyan region are rather typical signs of rotational stars. 
Fortunately, |McQuillan et alT] ( |2014[ ) have scanned the Ke¬ 
pler database for rotationally variable stars. In Fig. the 
location of the rotational stars found by them and being part 
of our sample are highlighted in red. One can see that nearly 
exclusively the cyan and the yellow regions are covered with 
rotationally variable stars, indicating that many of the stars 
in these two regions are rotationally variables as well. There¬ 
fore, the objects in these classes present high-fidelity candi¬ 
dates for rotational variability as well. It appears that espe¬ 
cially the yellow branch contains many formerly undetected 
objects which show similar dynamical behaviour to the ro¬ 
tating stars. We note that such a clear dependence between 
the visualisation and the location of the rotational stars is 
only visible in the ESN-AE visualisation. 

While the origin of most of the sources in the cyan and 
yellow region seems to be resolved, the red and the green 
regions do not exhibit variability behaviour which could 
uniquely identify their origin. Therefore, we inspected the 


data provided by Brown et al. (20111 in the Kepl er Input 
Catalog ( KICH). There, stellar model atmospheres ( [Castelli] 
& Kurucz 20041 were employed to photometric measure¬ 


ments of the Kepler sources in a probabilistic way. The 
obtained temperature is uncertain to ±200 K and the es¬ 
timate of the logarithm of the surface gravity is uncer¬ 
tain to 0.4 dea:. With those properties at hand, we highlight 
the giant stars in our sample, using the temperature-log p- 
criterion by Hekker et al. ( 2011| . 5969 of the visualised stars 
have a valid temperature and a valid \ogg, of which 13% 
(777) are giants. The giant stars are overplotted in green in 
Fig. I^as well. We note that they nearly exclusively populate 
the green region and are well separated from the remaining 
main sequence stars. 


In Fig. the individual properties from the KIC cat¬ 
alogue are overplotted for the visualised light curves. It 
is apparent (Fig. |5a[ ) that our visualisation does not cor¬ 
relate with metallicity. On the other hand, it has been 


noted in Brown et al. (20111 that the uncertainty of this 


estimate is rather high. In contrast to that, a clear corre¬ 
lation between our visualisation and the temperature ex¬ 
ists. Nearly all stars have temperatures between 4,000 and 
6,000 K (F- and G-type), but in Fig. |5b| a clear absence of 
hot or cold stars in the green region is apparent. Addition¬ 
ally, it seems that the hot stars are mainly located in the red 
and yellow branch, while cold stars can nearly exclusively be 
found in the bottom part of the cyan region. It should be 
noted, that the uncertainty in the temperature estimation 
is not sufficient to explain this separate behaviour. Further 
support for the visualisation comes from the distribution of 
surface gravities in Fig. which reflects that nearly all gi¬ 
ants are located in the green region. The correlation between 
variability behaviour and stellar properties, such as the sur¬ 


face gravity, has also been studied in Bastien et al. (20131. 


There, a correlation between the variability - that remains 
after subtracting the average brightness in 8 hour bins - and 


the surface gravity was discovered. In Huber et al. (20101, 


red giant variability was detected by investigating the cor¬ 
relation between frequency separation and the frequency of 
maximum power. In both publications, the goal was to de¬ 
tect giant stars based on their variability behaviour. To that 
end, explicit physical knowledge was invoked. It is therefore 
very pleasing to see that the proposed unsupervised method 
can also separate the giants based on their variability alone, 
without any knowledge about periodicity or other physical 
properties. 

So far, this work considered only the unsupervised 
task of dimensionality reduction of astronomical time se¬ 
ries. However, it is possible to extend it to supervised tasks 
such as classification or regression. We briefly make an ex¬ 
ample on classifying the giants previously identified in the 
visualisation. We trained two classifiers. The first one is a 
random foresIp] tr ained on all 29 features in the catalogue by 
Debosscher et al. (20111 yielding an accuracy of 96.0±0.4%. 


http: / / adsabs.harvard.edu/abs /2009yCat .5133... .OK 


^ We use a 1,000 trees with balanced class weights and a 3-fold 
cross-validation scheme. 
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The second one is also a random forest trained on the two 
visualisation components as inputs yielding a higher classi- 
hcation accuracy of 96.6 ± 0.2%. While we emphasise that 
this is not a conclusive statement on its own, it is remark¬ 
able to see that the classihcation on just the two visualised 
components performs as well as the one on the 29 (physi¬ 
cally motivated) features. The two visualisation components 
describe similarity in terms of the dynamical behaviour and 
appear to be highly informative. In addition, one can con¬ 
sider utilising the ESN readout weights w as features. These 
can then be used as inputs to a classifier with an objective 
(cross entropy) function that quantifies classihcation accu¬ 
racy rather than reconstruction error. 


6 CONCLUSIONS 

In this work, a new approach to visualise regularly sampled 
time series was presented. As opposed to visualisation al¬ 
gorithms in astronomy, the presented one does not require 
any pre-alignment of the data and respects the sequential 
nature of the time series. Besides that, it is capable to de¬ 
liver a shift invariant vector representation for sequential 
data of variable length. Compared to the common use of 
visualisation algorithms in astronomy, we do not employ 
the dimensionality reduction directly on the data, but on 
model parameters instead. We strongly advocate the use 
of sequential models to describe time series in astronomy 
and highlight the advantages of those using an ESN. The 
proposed ESN model returns a hxed-length vector represen¬ 
tation for a given sequence. This in turn can then be fed 
to a visualisation algorithm. In order to enhance the mean¬ 
ing of the visualisation, we measure the reconstruction error 
not in terms of reconstructing the model parameters but by 
measuring the direct implications on the reconstruction of 
the original light curves. This approach provides a power¬ 
ful objective function which also leads to a more meaningful 
visualisation. 

The proposed visualisation was demonstrated on a se¬ 
lected subset of 6, 206 light curves of variable stars. We stud¬ 
ied the quality of the plain and ESN-AE algorithms empiri¬ 
cally and concluded that the proposed coupled visualisation 
algorithm returns results that are easier to interpret. Fur¬ 
ther support for the proposed visualisation comes from the 
physical properties of the stars that have been derived from 
the time series data. With those, we can clearly see that the 
green cluster is mainly made up from giants with surface 
temperatures of 5,000 — 5, 500 K and a significantly lower 
surface gravity than the main sequence stars in the sample. 
Besides that, it appears that also separate regions are pop¬ 
ulated by hot (> 5,500K) and cold stars (< 5,500K). It 
is interesting that these physical properties (surface gravity, 
temperature) show up in the visualisation the way they do, 
as the underlying model is not aware of them. The corre¬ 
lation between physical properties and variability has been 
identihed in other works by explicitly looking for it using 
tailored features. The proposed visualisation confirms this 
correlation thus showing that these physical properties are 
inherent in the light curve dynamics. We speculate that the 
ESN readout representation could be further used in re¬ 
gression (e.g. predict surface gravity) and classification (e.g. 
main sequence versus giant stars) tasks. 


The presented approach is modular in the sense that 
parts of it can be simply replaced. The autoencoder is merely 
a convenient candidate but other visualisation algorithms 
could be used instead, perhaps with more favourable com¬ 
putational properties. Additionally, the underlying dynami¬ 
cal ESN model, could be replaced by other models capable 
of describing sequential data, such as auto-regressive models 
(ARMA). Finally, the approach is not limited to sequences 
and in principle other types of astronomical data, given a 
suitable model, can be visualised in the same fashion. Cur¬ 
rently, the visualisation of SDSS spectra using a blackbody 
model is investigated. 
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